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1. Introduction 

Let n be a large integer, and Mn be a random n by n matrix whose entries are 
i.i.d. Bernoulli random variables (each entry is ±1 with probability 1/2). In this 
paper we consider the well known question of estimating P„ :— P(det(M„) — 0), 
the probability that M„ is singular. 

By considering the event that two columns or two rows of Af„ are equal (up to 
sign), it is clear that 



P„> (l + o(l))n22i-". 

Here and in the sequel we use the asymptotic notation under the assumption that 
n tends to infinity. It has been conjectured by many researchers that in fact 



Conjecture 1.1. 



P„ = (l + o(l))n22i-" = (1 + 0(1))' 



In a breakthrough paper Kahn, Komlos and Szemeredi proved that 



(1.1) P„ = 0(.999"). 

In a recent paper I17| we improved this bound slightly to 0(.958"). The main result 
of this paper is the following more noticeable improvement. 

Theorem 1.2. 

Pn<{\ + o{i)y\ 
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The proof of this theorem uses several ideas from [TTl [17] . The key new ingredient 
is the appUcations of Freiman type inverse theorems and other tools from additive 
combinatorics. 

In the next section, we present the initial steps of the proof of Theorem ll.2l These 
steps will reveal the origin of the bound (| + o(l))" and also provide an approach 
to the conjectured bound (1 + o(l))n^2^~". 



2. Reduction to medium combinatorial dimension 

We begin the proof of Theorem 1 1.21 We first make a convenient (though somewhat 
artificial) finite field reduction. Observe that any n x n matrix with entries ±1 
can have determinant at most n"/^ (since the magnitude of the determinant equals 
the volume of a n-dimensional parallelopiped, all of whose edges have length ^/n); 
this is also known as Hadamard's inequality. Thus if we introduce a finite field 
F := Z/pZ of prime order 

(2.1) |F|=p>n"/2, 

then Pn is also the probability that a random n x n matrix with entries ±1 (now 
thought of as elements of F) is singular in the finite field F. Henceforth we fix the 
finite field F, and shall work over the finite field F instead of over R. In particular, 
linear algebra terminology such as "dimension" , "rank" , "linearly independent" , 
"subspace" , "span" , etc. will now be with respect to the field F. One advantage 
of this discretized setting is that all collections of objects under consideration will 
now automatically be finite, and one can perform linear change of variables over 
F without having to worry about Jacobian factors. Note that while one could use 
Bertrand's postulate to fix \F\ to be comparable to n"/^, we will not need to do so 
in this paper. Indeed it will be more convenient to take \F\ to be extremely large 
(e.g. larger than exp(exp(Cn))) to avoid any "torsion" issues. 

Now let { — 1, 1}" C F" be the discrete unit cube in F". We let X be the random 
variable taking values in {—1, 1}" which is distributed uniformly on this cube (thus 
each element of {—1,1}" is attained with probability 2~"). Let Xi,...,Xn G 
{—1,1} be n independent samples of X. Then 

Pn := PiXi, . . . , Xn linearly dependent). 

For each linear subspace V of F", let Ay denote the event that Xi, . . . , X„ span 
V. Let us call a space V non-trivial if it is spanned by the set V n {—1, 1}". Note 
that P{Av) ^ if and only if V is non-trivial. Since every collection of n linearly 
dependent vectors in F" will span exactly one proper subspace V of F" , we have 



(2.2) F„= 

V a proper non-trivial subspace of F" 
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In [17l Lemma 5.1] (see also [11]), we showed that the dominant contribution to 
this sum came from the hyperplanes: 

V a non-trivial hyperplanc in _F" 
Thus it will suffice to show that 

J2 P{Av)<{3/4 + o{l)r. 

V a non-trivial hyperplane in F" 

We remark that the arguments below also extend to control the lower-dimensional 
spaces directly, but it will be somewhat simpler technically to work just with hy- 
perplanes. 

As in [llj , the next step is to partition the non-trivial hyperplanes V into a number 
of classes, depending on a quantity which we shall call the combinatorial dimension. 

Definition 2.1 (Combinatorial dimension). Let D := {d± E Z/n : 1 < d± < n}. 

For any d± d D, we define the combinatorial Grassmannian Gr(c?±) to be the set 
of all non-trivial hyperplanes V in F" with 

(2.3) 2'^±-i/" < |yn{-i,i}"| < 2'*±. 

We will refer to d± as the combinatorial dimension of V . 

Remark 2.2. Following [11 , we have gradated the combinatorial dimension in steps 
of 1/n rather than steps of 1. This is because we will need to raise probabilities 
such as P(X S V) to the power n, and thus errors of 2'^'-'^/"^ in will be acceptable 
while errors of 20W can be more problematic. This of course increases the number 
of possible combinatorial dimensions from 0{n) to O(n^), but this will have a 
negligible impact on our analysis. 

It thus suffices to show that 

(2.4) Y. E p{Ay)<{-^+o{i)r. 

d±eD veGr{d±) 

It is therefore of interest to understand the size of the combinatorial Grassman- 
nians Gr(c?±) and of the probability of the events Ay for hyperplanes V in those 
Gr assmannians . 

There are two easy cases, one when d± is fairly small and one where d± is fairly 
large, both of which were treated in [111, and which we present below. 

Lemma 2.3 (Small combinatorial dimension estimate). [11] Let < a < 1 be 
arbitrary. Then 

Proof. Observe that ii Xi, . . . , Xn span V, then there are n — 1 vectors among the 
Xi which already span V. By symmetry, we thus have 
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(2.5) P{Av) = P{Xi, ...,X„ span V) < nP{Xi, . . .,Xn-i span V)P{X e V). 

On the other hand, if F G Gr(d±) and 2''±-" < a", then P(X e < a" thanks 
to (jO)) . Thus we have 

P(Av) < na"P(Xi, . . . , X„_i span F). 

Since Xi, . . . , X„_i can span at most one space V, the claim foUows. □ 

Lemma 2.4 (Large combinatorial dimension estimate). [11] We have 
5^ 5] P(Av)<(l + o(l))n22-". 

£i±G_D:2'^±-">100/VTr VeGT{d±) 

Proof. (Sketch) For this proof it is convenient to work back in Euclidean space 
R" instead of in the finite field model F"; observe that one can identify non- 
trivial hyperplanes in i^" with non-trivial hyperplanes in R" without affecting the 
combinatorial dimension. Let V C R" be a hyperplane in Gi{d±) with 2''±~" > 
i^. Then by we have 

P{X eV)> 50/Vn 

(say). Since V is spanned by elements yn{ — 1,1}" which lie in the integer lattice 
Z", the orthogonal complement V-^ C R" contains at least one integer vector 
a = (oi, . . . , a„) e Z"\{0} which is not identically zero. In particular we see that 

P{aiXi + ... + a„X„) > 50/ v^. 

But by Erdos's Littlewood-Offord inequality (see [5]) this forces at most n/2 (say) 
of the coefficients Oi, . . . , a„ to be non-zero, if ri is sufficiently large. Thus 

< P(a • Xi = . . . = a ■ Xn — for some a G fi) 

where $7 is the set of those vectors a = (ai, . . . , a„) £ Z"\{0} which have at most 
n/2 non-zero entries. But in [11', Section 3.1] it is shown that 

P(a • Xi = . . . = a • X„ = for some a e n) < {1 + o(l))n22"" 

and the claim follows. (In fact, in the estimate in 11] one can permit as many as 
n — 3 log2 n of the co-efRcients ai, . . . , a„ to be non-zero). □ 

We will supplement these two easy lemmas with the following, somewhat more 
difficult, result. 

Proposition 2.5 (Medium combinatorial dimension estimate). Let < eq 1, 

and let d± e D be such that (| -h 2eo)" < 2"^*-" < i^. Then we have 

^ P{Av)<o{ir, 

veGr{d±) 

where the rate of decay in the o(l) quantity depends on eg (but not on d±). 
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Note that D has cardinahty \D\ = 0{n^). Thus if we combine this proposition 
with Lemma [2.31 (with a :— j + 2eo) and Lemma [2.41 we see that we can bound 
the left-hand side of ((^ by 

+ 2eo)" + n^o{ir + (1 + o{l))n^2-" = + 2eo + o(l))". 
Since eo is arbitrary, Theorem 11.21 foUows . 

It thus suffices to prove Proposition 12.51 This we shaU do in later sections, but 
for now let us just remark that this argument suggests a way to obtain Conjecture 
11.11 in full. A comparison of the three bounds in Lemma 12.31 Lemma 12.41 a-nd 
Proposition 12.51 reveals that it is the estimate in Lemma 12.31 which is by far the 
most inefficient. Indeed, from the other two lemmas we now see that the conjecture 
Pji = (5 + 0(1))" is equivalent to proving that 

J2 E p(^v-)<(^ + o(i))" 

d±GD:2''±-"<(f +2eo)" VeGi:{dj,) 

for at least one value of eo > 0- This bound is sharp (except for the o(l) factor) as 
can be seen by considering the event that two of the Xi are equal up to sign. On 
the other hand, if we let B denote the event that aiXi + . . . + a„X„ for some 
tti, . . . , a„ G Z"\{0} with at most n — 31og2 n non-zero entries, it is shown in [111 
Section 3.1] that P{B) = (1 -I- o(l))n^2^" (this is basically the transpose of Lemma 
I2.4p . Thus it would suffice to show 

E E PiAv\B) = {l+oil)r. 

d±eD:2''±-"<(f +2eo)" veGr(d±) 

Based on Proposition 12.51 we may tentatively conjecture that in fact we have the 
stronger statement 

(2.6) E E nAv\B)^om 

rf±e_D:2''±-"<(|+2eo)" veGr{d±) 

this for instance would even imply the stronger conjecture in Conjecture 11.11 But 
our Fourier-based methods seem to hit a natural limit at 2'^±^" ~ (3/4)" and so 
are unable to obtain any further progress towards (|2.6p . 

3. The general approach 

We now informally discuss the proof of Proposition 12.51 the rigorous proof will 
begin in Section 01 We start with the trivial bound 

(3.1) E ^ 1 

V<£GT{d±) 

that arises simply because any vectors Xi, . . . , X„ can span at most one space V. 
To improve upon this trivial bound, the key innovation in [TT] is to replace X by 
another random variable Y which tends to be more concentrated on subspaces V 
than X is. Roughly speaking, one seeks the property 

(3.2) P{X eV)< cP{Y e V) 
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for some absolute constant < c < 1 and for all (or almost all) subspaces V g 
Gr((i±). From this property, one expects (heuristically, at least) 

(3.3) PiAv) = P(Xi, . . . ,X„ span V) < c"P(yi, . . . , r„ span V), 

where Yi, . . . ,Yn are iid samples of Y, and then by applying the trivial bound (j3.1l) 
with Y instead of X, we would then obtain a bound of the form J2veGi:{d±) '^i^v) — 
c", at least in principle. Clearly, it will be desirable to make c as small as possible; 
if we can make c arbitrarily small, we will have established Proposition [^3] 

The random variable Y can be described as follows. Let < n < 1 be a small 
absolute constant (in [11] the value n = igge"-^/^"^ was chosen), and let ry^'^^ be a 
random variable taking values in {—1, 0, 1} C F which equals with probability 1 — 
H and equals +1 or —1 with probability /i/2 each. Then let Y := ■ ■ ■ , Tyri^) € 

i^", where T]f'\ ■ ■ ■ are iid samples of 77''^^. By using some Fourier-analytic 
arguments of Halasz, a bound of the form 

P(X <eV)< Cy/JIP{Y e V) 

was shown in |11| . where C was an absolute constant (independent of fi), and V 
was a hyperplane which was non-degenerate in the sense that its combinatorial 
dimension was not too close to n. For /i sufficiently small, one then obtains (|3.2[) 
for some < c < 1 , although one cannot make c arbitrarily small without shrinking 
fi also. 

There are however some technical difficulties with this approach, arising when one 
tries to pass from (|3.2p to (|3.3|) . The first problem is that the random variable Y, 
when conditioned on the event Y € V, may concentrate on a lower dimensional 
subspace on V, making it unlikely that Yi, . . . ,Y„ will span V. In particular, Y 
has a probability of (1 — /i)" of being the zero vector, which basically means that 
one cannot hope to exploit (|3.2p in any non-trivial way once P{X G V) < {1 — n)"^ . 
However, in this case V has very low combinatorial dimension and Lemma 12.31 
already gives an exponential gain. 

Even when (1 — /i)" < P(X gV)<1, it turns out that it is still not particularly 
easy to obtain (|3.3p . but one can obtain an acceptable substitute for this estimate 
by only replacing some of the Xj by Yj. Specifically, one can try to obtain an 
estimate roughly of the form 

(3.4) P(Xi, . . . ,X„ span V) < c"P(ri, . . . ,i;„,Xi, . . . ,X„_„ span V) 

where m is equal to a suitably small multiple of n (we will eventually take m ~ 
71/100). Strictly speaking, we will also have to absorb an additional "entropy" loss 
of (^) for technical reasons, though as we will be taking c arbitrarily small, this 
loss will ultimately be irrelevant. 

The above approach (with some minor modifications) was carried out rigorously in 
[TT] to give the bound P„ = 0(.999") which has been improved slightly to 0(.939") 
in [17j . There are two main reasons why the final gain in the base was relatively 
small. Firstly, the chosen value of /x was small (so the n(l — /z)" error was sizeable), 
and secondly the value of c obtained was relatively large (so the gain of c" or c^^"'''^" 
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was relatively weak). Unfortunately, increasing /i also causes c to increase, and so 
even after optimizing /i and c one falls well short of the conjectured bound. 

In this paper we partially resolve this problem as follows. To reduce all the other 
losses to (| + 2eo)" for some small Eq, we increase /i up to 1/4 — eo/100, at which 
point the arguments of Halasz and [11], [17] give (|3.2p with c = 1. The value 1/4 
for fj, is optimal as it is the largest number satisfying the pointwise inequality 

I cos(a;)| < (1 — ^) + ^cos(2a;) for all x e R, 

which is the Fourier-analytic analogue of (|3.2p (with c = 1). At first glance, the 
fact that c — 1 seems to remove any utility to (13. 2p . as the above argument relied 
on obtaining gains of the form c" or c^^~'^^". However, we can proceed further 
by subdividing the collection of hyperplanes Gi{d±) into two classes, namely the 
unexceptional spaces V for which 

P(X eV)< eiP{Y e V) 

for some small constant < ei ^ 1 to be chosen later (it will be much smaller than 
Sq), and the exceptional spaces for which 

(3.5) eiP{Y eV)< P{X eV) < P{Y e V). 

The contribution of the unexceptional spaces can be dealt with by the preceding 
arguments to obtain a very small contribution (at most (5" for any fixed S > given 
that we set ei = £1(7, 6) suitably small), so it remains to consider the exceptional 
spaces V. The key new observation (which uses the Fourier analytic methods of 
Halasz and [11]) is that the condition (j3.5|) can be viewed as a statement that 
the "spectrum" of V (which we will define later) has "small doubling constant". 
Once one sees this, one can apply theorems from inverse additive number theory (in 
particular a variant Freiman's theorem) to obtain some strong structural control on 
V (indeed, we obtain what is essentially a complete description of the exceptional 
spaces V, up to constants depending on Eq and ei). This then allows us to obtain 
a fairly accurate bound for the total number of exceptional spaces V , which can 
then be used to estimate their contribution to (|2.2p satisfactorily. 



4. Reduction to exceptional spaces 



We now rigorously carry out the strategy outlined in Section (3] We first pick the 
parameter /i as 

and let Y £ {—1,0, 1}" C F"^ be the random variables defined in Section |3| using 
using this value of fi. We let Yi, . . . , Y„ be iid samples of Y , independent of the 
previous samples Xi, . . . , X„ of X. Thus we may write 

n n 

(4.2) X = Y,^e,: Y = Y,^f\, 

where the r]j^^ are iid samples of the random variable i]^^"' introduced in the previous 
section, rj'^j^^ are iid random signs, and ei, . . . , Cn is the standard basis of F". 
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Next, we introduce a small parameter ei > (which will be much smaller than eg)- 
Let us call a space V £ Gr(d±) exceptional if 

(4.3) P(X eV)> eiP(r e V) 
and unexceptional if 

(4.4) P{X eV)< eiP{Y e V). 

As the terminology suggests, the exceptional spaces will turn out to be relatively 
rare compared to the unexceptional spaces, as will be seen by comparing Lemma 
14.11 and Lemma [4?2l below. 

Proposition 12 . 51 is now a consequence of the following two sub-lemmas. 
Lemma 4.1 (Unexceptional space estimate). We have 

(4.5) Y ^ 2°(")2"ef 
veGT(d±):V unexceptional 

where the decay rate in the o{n) term can depend on eojCi- 
Lemma 4.2 (Exceptional space estimate). We have 

J2 Pi^v) < n-t+°(«) 

veGT{d±}:V exceptional 
where the decay rate in the o{n) term can depend on eo,ei. 

Indeed, upon combining Lemma |4. II and Lemma l4.2l we obtain 

Y P{Av) < 2°(")2"ef + „-»/2+°(»), 
yeGr(£i±) 

which implies Proposition 12.51 since £i can be chosen arbitrarily small. 

It remains to prove Lemma 14.11 and Lemma 14.21 We prove Lemma 14.11 which is 
simpler in this section, and leave the more difficult Lemma 14.21 to later sections. 

Proof of Lemma \4-l\ We adapt some arguments form [TT] , [T7] ; the estimates here 
are somewhat cruder (and simpler) than in those papers, because we have the addi- 
tional factor of El which can absorb several losses which can arise in this argument. 
We begin with a lemma. 

Lemma 4.3 (Weighted Odlyzko lemma). [11] Let < d < n. If W is any d- 

dimensional subspace of , then 

p(r ^w)<{i- fiY'-'^. 

Proof. Write Y = (771, .. . ,??„). Then there exists some d-tuple (ji, . . . ,jd) of co- 
ordinates which determine the element of W; this means that once rjj-^, . . . , rjj^ are 
selected, there is only one possible choice for each of the remaining n—d independent 
random variables rji if one wishes Y to lie in W . Since any given value is attained 
by an rji with probability at most 1 — the claim follows. □ 
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Let TO be the nearest integer to £o"-/100. In addition to the random variables 
Xi, . . . , Xn, we create more random variables Yi, . . . , Ym which have the same dis- 
tribution as Y , and which are independent of each other and of Xi, . . . , X„. 

Using Lemma 14.31 we have 

Lemma 4.4. Let V G Gr{d±) be unexceptional, and let Bv,m denote the event that 
Yi, . . . , Yjn are linearly independent and lie in V . Then 

PiBv^m) > 2-°(")(2''±-"/ei)™. 
Proof. Using Bayes' identity, we can factorize 

rn 

P{Bv,m) =Y[P{Bv,^\Bv.^-l) 

i=l 

where Bva is the event that Yi, . . . ,Yi are linearly independent and lie in V. Let 
Wi be the linear subspace of F" spanned by Yi , . . . , . Then conditioning on any 
fixed value of Yi, . . . , Yi_i in the event By.i-i, we have 

P{Bv,^\Bv^^^l) = P(Y eV)- P(Y e W,), 

since Yj is independent of Yi, . . . , Yi_i and has the same distribution as i. By (j4.4p . 
(|2.3p we have 

P(Y eV)> —P{x ev)> J_2-i/»2'^±-". 

On the other hand, since Wi has dimension at most m, we see from Lemma 14. 3[ 
(|4l|) that 

P(Y e W.) < (1 - m)"-" - + ^)""™- 

Since to = £on/100 + 0(1), and (| + 2eo)" < 2''±^" by hypothesis, we thus see that 

P(Y e W,) = 0{l/n)P{Y e V) 
(for instance) if n is sufficiently large depending on Eq. Thus we have 

P{Bv.^\Bv,^-l) > (1 - 0(l/n))2'^±-"/£i. 
Multiplying this together for all 1 < i < to, the claim follows. □ 

To apply this lemma, observe that Ay and Bv,m are clearly independent, as they 
involve independent sets of random variables. Thus we have 

PiAv) < 2°(")(2'*±-"/£i)-™P(yly A By). 

Now let Xi, . . . , Xn, Yi, . . . , Yto be in the event Ay A By. Since Yi, . . . , Y™ are 
linearly independent in and Xi, . . . , X„ span V, we see that there exist n — m 
vectors in Xi, . . . , Xn which, together with Yi, . . . , Y^, span V. The number of 
possibilities for such vectors is which we can crudely bound by 2". By 

symmetry we thus have 

P{Ay) < 2°(")(2'*±-"/ei)-"2"P(Cy), 

where Cy is the event that Yi, . . . , 1^, Xi, . . . , X„_m span and that Xn-m+ii ■ ■ ■ , X, 
lie in V. By independence we then have 

PiAy) < 2°(")(2'^±-"/£i)-"2"P(Yi, . . . , Y™,Xi, . . . ,X„_,„ spanF)P(X e F)". 
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Applying (12. 3p we conclude 

P{Av) < 2°(")(2'*±-"/£i)-'"2"P(yi, . . . , r„, Xi, . . . , spany)(2'^±-")". 

Summing this over V, and observing that each collection of vectors Yi , . . . , Ym , Xi , . . . , X, 
can only span a single space V, we conclude 

PiAv) < 2°(")(2''±""/ei)"™2"(2''±"")™ 

yeGr(d±):y unexceptional 
and the claim follows from the choice of m. □ 



5. Structure of exceptional subspaces 

To conclude the proof of Theorem II. 2|, the only remaining task is to prove Lemma 
K2\ Observe from that 

P{Av) < P{X £ V)" < 2-"("-''±) 

for all V E Gr{d±). Thus it will suffice to show that 

(5.1) \{V e Gr(d±) : V exceptional}] < „-t+°(")2"("-''±). 

To do this we require a certain structural theorem about exceptional subspaces, 
which is the main novelty in our argument, and the one which requires tools from 
inverse additive number theory such as Freiman's theorem. 

We need some notation before we can describe the structure theorem. 

Definition 5.1 (Progressions). Let Mi, . . . , Mr be integers and let a, ui, . . . , be 

non-zero elements of F. The set 

P := {a + mivi + . . . + rn^Wr : -Mj /2 < mj < M, /2 for all 1 < j < r} 

is called a generalized arithmetic progression of rank r; we say that the progression 
is symmetric if a = 0. We say that P is proper if the map 

(toi, . . . , mr) I—!- miVi + . . . + rrirVr 

is injective when —Mj/2 < nrij < Mj/2. If P is proper and symmetric, we define 
the P-norm \\v\\p of a point v = miVi + . . . + m^Wr in P by the formula 

Let V e Gr(d±) be an exceptional space, with a representation of the form 

(5.2) V = {(xi, . . . , a;„) e F" : xiai + ... + a:„a„ = 0} 

for some elements ai, . . . , a„ € F . We shall refer to ai, . . . , a„ as the defining 
co-ordinates for V . 
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Theorem 5.2 (Structure theorem). There is a constant C — C(eo,ei) such that 
the following holds. Let V be an exceptional hyperplane in Gr(d±) and ai, . . . ,an 
be its defining co-ordinates. Then there exist integers 

(5.3) 1 < r < C 
and Ml , . . . , Mr > 1 with the volume bound 

(5.4) Ml... Mr < C2"-''± 

and non-zero elements vi, . . . ,Vr G F such that the following holds. 

• ft) (Defining coordinates lie in a progression) The symmetric generalized 
arithmetic progression 

P := {mivi + . . . + nirVr ■ -Mj/2 < rUj < Mj/2 for all 1 < j < r} 

is proper and contains all the Oj . 

• fti) (Bounded norm) The a; have small P-norm: 

n 

(5.5) $]||a,||l.<C 

• (Hi) (Rational commensurability) The set {vi, . . . , Vr}U{ai, . . . , a„} is con- 
tained in the set 

(5.6) {^z;i:p,gGZ;g^0;b|,|9|<n°(")}. 

q 

Remark 5.3. The condition (i) asserts that the defining co-ordinates ai, . . . , an oiV 
lie in a fairly small generalized arithmetic progression with bounded rank 0(1). The 
condition (|5.6p asserts, furthermore, that this progression is contained in a (rank 
one) arithmetic progression of length n°("l Thus we have placed {ai, . . . , a„} inside 
two progressions, one of small size but moderately large rank, and one of rank one 
but of fairly huge size (but the dimensions are still smaller than n"/^ or \F\). 

Remark 5.4. The structure theorem is fairly efficient; if oi, . . . ,a„ obey the con- 
clusions of the theorem, then from (|5.5p and the theory of random walks (taking 
advantage of the boundedness of r, which will play the role of dimension) we ex- 
pect the random variables r][^^ai -I- ... -I- r][^^an and rj[^^ai -I- ... -I- rlf"^ On to be 
distributed fairly uniformly on P and decay rapidly away from P, and so we expect 
the probabilities P(X e y) and P(y G V) to be comparable (up to a constant). 

We shall prove the structure theorem in later sections. For the remainder of this 
section, we show how the structure theorem can be used to prove Lemma 14.21 In 
the sequel all of the o() factors are allowed to depend on eo, ei- 

Let V £ Gr{d±) be an exceptional hyperplane. Then V has |F| — 1 representations 
of the form (|5.2p . one for each non-zero normal vector of V. Let us call an n-tuple 
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(oi, . . . , ttn) of elements in F exceptional if it obeys the conclusions of the structure 
theorem for at least one progression P, thus we have 

\{V e Gr(d±) : V exceptional}! ^tttt 

\F\ - 1 

|{(ai, . . . ,a„) eF" : (ai, . . . ,a„) exceptional}], 
and so it now suffices to show 



|{(ai, . . . , a„) e : (ai, . . . , a„) exceptional}] 



We now need to count the number of exceptional (oi, . . . ,a„). We first observe 
that we may fix the parameter r in the conclusion of the structure theorem, since 
the number of r is at most C = n"*^"-* . Similarly we may fix each Mi , . . . , Mr , since 
the total number of choices here is at most (C2"'~''±)'' = n°^"^ 

The number of possible choices for vi is at most Once this vector is selected, we 
see from (|5.6I) that there are at most n"^"-' possible choices for each of the remaining 
r—1 = 0(1) vectors. Putting the estimates together, we can conclude that the total 
number of possible vectors {vi, . . . ,Vr) which could be chosen is at most 

It now suffices to show that, for each fixed choice of r. Mi, . . . , Mr, vi, . . . ,Vr (which 
in particular fixes P), that the number of possible exceptional (ai, . . . , a„) is at most 
^o{n)^-n/22n(n-d±) ^ Wc are going to use the bound on the P-norms of the Ui. Our 
task is to show that for any constant C 



(5.7) {(ai,...,a„) e P" : ^||aj|||, < C} < n°(")n-»/22»("-rf±). 

We shall use Gaussian-type methods. To start, notice that 



{(ai 



<C} 



< n 



o(n) 



exp(-n^| 



(ai 



3 = i 



The right hand side can be rewritten as 



n°(")(^exp(-n||a|||)) = n°(") ( J] E e'""?/*^^ 

aeP j=^\m.j\<Mj 
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Using the elementary bound 

\m\<M 

we can bound the right hand side from above by 



l<j<r ^ 

Since r = 0(1) and Mi, . . . , > 1, the product ni<j<r(l + 

can be bounded 

(rather crudely) by 0(1 + nT^/'^Mi . . . Mr). This leads us to the following bound 

n°(")(l + n-i/2A/i...M,.)". 

Now we use the information (see (|5.4p in Structure Theorem) that Mi . . . Mr is at 
most C2"~''± . The bound becomes 

as desired, thanks to the hypothesis 2'^±^" < 100/^/n- 

This concludes the proof of Theorem 11.21 except for the Structure Theorem, which 
we turn to next. This proof requires a heavy use of tools from additive combina- 
torics. We are going to introduce these tools in the next section. 



6. Generalized arithmetic progressions and inverse theorems 

In this section we review some notation and results from additive combinatorics, 
and briefly sketch how they will be used to prove Theorem [521 The rigorous proof 
of this theorem will begin in Section [71 

Let A and B be finite subsets of G. By A + B := {a + b : a £ A,b £ B} we denote 
the set of all elements of G which can be represented as the sum of an element from 
A and an element from B; A ~ B :— {a — b : a G A,b G B} is defined similarly. 
Moreover, for any positive integer I, we recursively define lA := (I — 1)A + A, thus 
I A is the set of ^-fold sums in A (allowing repetition) . 

The doubling constant of a finite non-empty set ^4 C _F is defined to be the the ratio 
|A + It is easy to see that if A is a dense subset (with constant density 6) of 
a proper generalized arithmetic progression of constant rank r, then the doubling 
number of A is bounded from above by a constant depending on S and r (in fact it 
is bounded by 6^^ 2^). 
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In the mid 1970s, Freiman [6] proved a remarkable result in the converse direction, 
which asserts that being a dense subset of a proper arithmetic progression of con- 
stant rank r is the only reason for a finite set A of integers to have small doubling 
constant: 

Theorem 6.1 (Freiman's theorem). For any constant C there are constant 
r and 5 such that the following holds. For any finite set A of integers such that 
\A + A\ < C\A\, there is a proper arithmetic progression P of rank r such that 
AC P and \A\/\P\ > 6. 

Freiman's proof was rather complex. A cleaner proof, based on Freiman's ideas, is 
presented by Bilu in [1] . Ruzsa [Ej gave a different and quite short proof; this was 
then refined by Chang [4] . There are explicit bounds known on the values of r and 
S in terms of C, but for our application these bounds will only influence the o(l) 
term in our final result and so we will not keep track of them here. 

Freiman's theorem was initially phrased in the integers Z, but it can easily be 
transferred to a finite field F of prime order if F is sufficiently large: 

Theorem 6.2. For any constant C there are constant r and S such that the follow- 
ing holds. Let F be a finite field of prime order, and let A be a non-empty subset 
of F such that \A + A\ < C'\A\. Then, if \F\ is sufficiently large depending on 
\A\, there is a generalized arithmetic progression P of rank r such that A C P and 
\A\/\P\>S. 

This theorem follows from Freiman's original theorem and "rectification" theorems 
such as [H Theorem 3.1] (see also [7]) to map A via a Freiman isomorphism to 
the the integers Z (using the hypothesis that F is large depending on A; \F\ > 
exp(C|A|) would do). It is also a special case of the version of Freiman's theorem 
established in [8] for an arbitrary abelian group. In fact, that result allows one to 
remove the hypothesis that |i^| is sufficiently large depending on \A\. 

A generalized arithmetic progression is not always proper. We can, however, make 
this assumption whenever we like (at the cost of a constant factor) thanks to the 
following lemma. 

Lemma 6.3 (Progressions lie inside proper progressions). There is a constant C 
such that the following holds. Let P be a generalized arithmetic progression of rank 
r in an abelian group G. Suppose that every non-zero element of G has order at 
least r'~^^ \P\. Then there exists a proper arithmetic progression Q of rank at most 
r containing P and 

|g| <r^'-V|. 

The proof of this lemma arises purely from the geometry of numbers (in particular, 
Minkowski's second theorem) and is independent from the rest of this paper. In 
order not to distract the reader, we defer this rather technical proof in the Appendix. 



ON THE SINGULARITY PROBABILITY OF RANDOM BERNOULLI MATRICES 15 



Finally, we are going to need the following lemma, based on a covering argument 
of Ruzsa [16]. We say that a set A is symmetric if the set —A := {—a, a E A} is 
equal to A. 

Lemma 6.4 (Sum set estimates). Let A be a symmetric finite subset of an abelian 
group G such that \AA\ < C\A\ for some C > 1. Then for any fc > 4 




Proof. We can assume that k > 4. Consider the sets rj + A as ri ranges inside 
3A. Each set has cardinality \A\, and is contained inside AA. Thus we may find 
a maximal disjoint collection {£, + A : rj E X} where X C 3A has cardinality 
\X\ < \AA\/\A\ < C. Then for any ^ G 3A there exists rj E X such that 77 + A 
intersects ^ + A, otherwise this would contradict maximality. But this implies that 
S,eX + A-A = 2A + X. Thus we have 

A + 2A^'iA<Z2A + X. 

Iterating this we obtain 

kA(l2A+{k- 2)X 

for aU fc > 2. Thus 

\kA\ < \2A\\{k - 2)X\ < C\A\{k - 2)X\. 
To conclude the proof, notice that for any I 

□ 

In fact one can replace the condition \4:A\ < C\A\ by a weaker condition |3yl| < 
C\A\. 

We are going to use Theorem 16.21 to derive a structural property of the defining 
vectors {ai, . . . , a„} of V in Structure Theorem. To be more precise, we are going 
to apply Theorem 16.21 to a set A with small doubling constant which contains 
the majority of the defining vectors {oi, . . . ,a„}. Thus we obtain a generalized 
arithmetic progression P containing A (and with it most of the ai). It will be easy 
to extend P to contain all a.; without violating the properties we care about. 

The application of Theorem 16.21 is not immediate. In the next section, we shall 
provide a Fourier-analytic argument which leads to the definition of A. The starting 
point of this argument is based on the method which is motivated by an 
observation of Halasz [9] ; this method shall be the focus of the next section. 
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7. HALASZ-TYPE ARGUMENTS 



Let V S Gr{d±) be an exceptional space with the representation (|5.2p . In order 
to obtain the desired structural control on the defining co-ordinates oi, . . . , a„, we 
shall first use Fourier analysis to first gain some strong control on the "spectrum" 
A of ai, . . . , a„ (which we will define in (j7.4|) ). and then apply the inverse Fourier 
transform to recover information about ai, . . . , a„. 



We turn to the details. By Fourier expansion we have 

^{'xi,...,x„)ev = 7^ X! ep(a;iai^ + . . . + a;„a„^) 

for all (xi, . . . , Xn) S i^", where p = \F\ and Cp is the primitive character ep{x) := 
g27rza;/p_ gy (|4.2p wc thus havc (by linearity of expectation and by independence) 

P{X eV)=-E{lxev) 

= E n^pUi^ai^ + ■■■ + vi'^ci^O) 



1 " 



In particular we have 

1 " 

P(^ e < ^ E n I cos(2™,C/p)| 



1 " 

I I «GJ=^i=i 
1 " 1 1 

=MEn(o+o^«^(2-«.^/^'))' 



/2 

i^l ^ -'-'-'2 ' 2 

where the first identity follows from the substitution ^ i— > ^/2 on F, noting that the 
quantity | cos(7raj^/p)| is still periodic in ^ with period p. Arguing similarly with 
Y, we have 



1 " 
1 " 

= M E n((l " ^) + MCos(27ra,C/p))- 



' ' ?e-Fj=i 

To summarize, if we introduce the non-negative quantities 

(7.1) fiO ■■= 11(2 + 2^°'^^™^'^/^))'^'' ^(^^ ^= n((l-A') + Aicos(2^a,^/p)) 
i=i j=i 
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then we have 

(7.2) p(xey)<^5]/(0; PiYeV)^-^Y.9iO- 



We now give a crucial comparison estimate between / and g. 
Lemma 7.1. For all ^ E F, we have /(O < g{£.y^*^- 

Proof. By (|7.ip sufRces to show that 

(1 + 1 cos 0)1/2 <((1 cos 

for any 6. Writing cos^ =1 — 22: for some < a; < 1 (the x — case being trivial), 
this can be rearranged to become 

log(l -x)- log(l - 0) ^ log(l - 2fix) - log(l - 0) 
X ~ 2fix 

But this follows from the concavity of the function log(l — x) on < a; < 1 and the 
fact (from (gH)) that < 2/i < 1. □ 



Since g{(,) is clearly bounded by 1, and since /i < 1/4 by (|4.ip . it follows that for 
every ^ 

(7.3) /(0<5(0, 
which when combined with (|7.2p gives 

P(^ eV)< P{Y £ V). 

We now refine this argument to exploit the additional (and critical) hypothesis (|4.3[) 
that 

P(^ eV)> eiP{Y e V). 

Let £2 > be a sufficiently small positive constant (compared to ei). Define the 
spectrum A C of {ai, . . . , a„} to be the set 

(7.4) A U e F : f{0 > £2} 

Note that A is symmetric around the origin: A = —A. Next we make the elementary 
observation that 

(7.5) 1 - 100||x|p < cos(27rx) < 1 - y^ll^^lP 

(say), where ||a;|| is the distance of x to the nearest integer. From (|7.ip we thus 
have (with room to spare) 



1 " 

and hence there is a constant C(e2) depending on £2 such that 
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(7.6) {Y,\\ar^/p\?f'^<C{e2) 
for all ^ e A. 

We now obtain some cardinality bounds on A and of the iterated sumsets fcA. 
Lemma 7.2. There is a constant C depending on eo,ei,e2 such that 

(7.7) C-i2"("-''±)|i^| < |A| < C2-("-''±)|F| 
Furthermore, for every integer k > A 

(7.8) |M|<(^+^2^)c2-("-^±)|F|. 

Remark 7.3. The lemma implies that A has a small doubling constant. At this 
point one could apply Theorem [621 to gain further control on A. While this can be 
done, it is more convenient for us to apply the inverse Fourier transform and work 
on a certain "dual" set to A, which then contains most of the Oj. A key point in 
(|7.8p . which we will exploit in the proof of Lemma [7^ below, is that the growth of 
constants is only polynomial in k rather than exponential. 

Since depends on eo, . . . , Q-i, it would sufhce to say that C depends only on eg. 



Proof. From (|4^ . we have 

But from Lemma (7.11 the definition of A, and the crucial fact that n < 1/4, we 
know that 



I I I I C^A 



l-4/i 
2 1 



Now we are going to make an essential use of the assumption that fi is less than 
1/4. Given this assumption, we can guarantee that the contribution of ^ outside 
the spectrum is negligible, by choosing £2 sufficiently compared to ei. This results 
in the following equalities 



E /(^) ^ Q(E /(^)) = Q(E 5(0) = e(|F|P(x e V)) = e(2''±-«|i^|), 

CGA ieF ieF 

where the constants in the 0() notation depend on eo,ei,e2. The bounds on |A| 
now follows directly from the fact that for any £, £ A, 62 </{£,)< 1- 
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It remains to prove (17. 8p . In view of Lemma [631 it suffices to show that there is a 
constant C such that 



|4A| < C|A|. 

Notice that if ^ S 4A, then by (|7.6p and the triangle inequahty we have 

n 

for some constant C{t2) depending on £2 (we abuse the notation a Uttle bit, as the 
two C(e2) are not necessarily the same). From ()7.5|) . (|7.ip we conclude a bound of 
the form 



/(O > c(e2) > 0. 

Thus 



|4A| < c(e2)-^ E /(O < cXe^r' ^ <?(0 = e(2'^±-"|f^|) = e(|A|) 
concluding the proof. 

□ 

We now pass from control of the spectrum back to control on {ai, . . . , a„}, using 
the inverse Fourier transform. For any x ^ F, define the norm ||a;||A by 

It is easy to see that this is a quantity between and 1 which obeys the triangle 
inequality ||a; + ?/|jA < ||a;||A + H^Ha. From the triangle inequality again we have 

= 2(^Eii</p||')^/'- 

Thus by square-summing (|7.6p for all ^ € A, we obtain 

71 

(7.9) Eii«^-IIa^^(^2). 
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Thus we expect many of the aj to have smah A norm. On the other hand, we are 
going to show that the set of elements with small A norm has constant doubling 
number, thanks to the following lemma. 

Lemma 7.4. There is a constant C such that the following holds. Let A (- F 
denote the "Bohr set" 

A:^\{xeF:\\xU<^}\. 

Then we have 



Proof. Let us first establish the upper bound. We introduce the normalized Fourier 
transform of A: 

Kx) ■■= 777 

If a; e A + A, then we have ||x||a < ^ by the triangle inequality. Using (|7.5p . we 
conclude 



\h{x)\^ = m{x)h{x) 

= E cos(27ra;(e - e)/p) 

= i-m\x\\i 



As II a; II A < it follows that 



\h{x)\ > Vl - 100(1/50)2 > 1/2. 



On the other hand, from the Parseval identity we have 
(7.10) 5]|M2:)|' = |^^|/|A|; 

combining these two estimates we obtain 

|A + A| <4|i^|/|A| 
and the claim now follows from (17.71). 



Since the bound \A\ < \A + A\ is trivial, it now suffices to prove the lower bound 
on A. This proof has two parts. First we show that if \h{x)\ is large (close to 1), 
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then ||a;||A is small (close to zero). Next, we prove that there are many x such that 
\h{x)\ is large. 

The first part is simple and similar to the above argument. Using the fact that 
||2;||a < and (|7.5|) . we have 

\h{x)\'^ = m{x)h(xj 

= hHI E cos(27rx(e - C)/P) 

E i-i^Hc-eW 

= 1_ ^||^||2_ 

lOO" 

It follows that if \h{x)\ > 1 - 10"'* then x e A. 

Now we are going to show that the set of those x such that \h{x)\ > 1 — 10"'* has 
cardinality 0(2""'^±). This proof is trickier and we need to adapt an argument from 
[7] (see also 0), which requires the full strength of ()7.8p . Let fc be a large integer 
to be chosen later, and for each ^ e -F, let rk{£,) be the number of representations 
of the form ^ = + . . . + where , . . . , e A. Clearly we have 

CefeA 

and so by Cauchy-Schwarz 

j2\^km'>\M"'/\kA\- 

By Plancherel we have 

Ei^fc(^)i' = MEiE^'«(^)^f(^^)i'- 

But we have 

iE''fc(^)^f(<)i = i E ep(x(a + ... + 6)i 

ieF 4i,...,«fceA 

and hence 

E > l^l/I^A|. 

xeF 

Combining this with (|7.10p we obtain 

E > l^l/2|fcA|; 

a:eF:|/i(2:)|>(|A|/2|fcA|)i/(2fc-2) 
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since h{x) is trivially bounded by 1, we thus conclude 

|{.T G F : \h{x)\ > (|A|/2|fcA|)i/(2'=-2)}| > \F\/2\kA\. 
Inserting the bounds (|7.7p . (|7.8p we have 

for some constant C. Choose k sufficiently large (say k = + lO^*') we can 
guarantee that the right hand side is at least 1 — 10^"*. On the other hand k is still 
a constant, so \kA\ = 0(2'^±-"|F|). It follows that 

\{x£F: \h{x)\ > 1 - 10-4}| ^ e(2"-''±), 
proving the lower bound on A. □ 

8. Proof of Structure Theorem 

We are now in position to prove the Structure Theorem. In this section we allow the 
constants in the 0{) notation to depend on eg, ei, 62- From Lemma 17.41 we already 
know that A has doubling constant 0(1). Furthermore, it is easy to show that A 
contains most of the a^. Indeed, if is not in A then its A-norm is at least 1/100; 
(|7.9p then shows that 

\{1 <i <n : Gi A}\ = 0(1). 

We now apply Theorem 16. 21 to A, to conclude that there is a generalized arithmetic 
progression P of rank 0(1) containing A and with \A\ — 0(|P|) = 8(2"~''±). By 
Lemma 16.31 we can assume that P is proper (note that every non-zero element of 
F has order ^ p> ti"/^, which is certainly much larger than |^|). 

This progression P is close to, but not quite, what we need for the structure the- 
orem. Thus we shall now perform a number of minor alterations to P in order 
to make it exactly match the conclusions of the structure theorem. Each of our 
alterations will preserve the following properties claimed in the structure theorem, 
namely 

(8.1) rank(P) = 0(1) and 

(8.2) |P| = Mi...Af^ = 0(2"-''±), 

though of course the 0() constants may worsen with each of the alterations. 

First, we can include the elements of {ai, . . . , a„}\A in P simply by adding each 
exceptional aj as a new basis vector v'^ (with the corresponding length Afj set equal 
to 3). This enlargement of P (which by abuse of notation we shall continue to call 
P) now contains A and all of the aj , and obeys the bounds (|8.ip , (|8.2p with slightly 
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worse constants. Notice that by Lemma 16.31 we can always assume P is proper, 
without affecting the vahdity of (|8.1|) and ()8.2|) other than in the constants. 

It now remains to verify the estimate (jS.Sp on the P-norm, as well as the rational 
commensurability claim. 

To verify (|5.5p . consider the sum ||aj||p. For the exceptional aj (which were 
added to P afterwards) ||aj||p — 0{1). However, the number of these aj is 0{1). 
For the remaining aj, notice that P contains kaj for all k < xooyfSTUX- Thus, (|5.5|) 
follows from (17. 9|) which asserts that J^j II'^jIIa = '^(1)- This leaves the verification 
of the rational commensurability as the only remaining task. 

Let us, as usual, write 

P = {mivi H h mrVr \ - Mj/2 < rUj < Mj/2}. 

We consider P together with the map $ : P — > R*^ which maps miVi + • • ■+mrVr to 
(mi, . . . , TUr). Since P is proper, this map is bijective. We set U ;= {ai, . . . , a„} C 
P. 

We know that the progression P contains U, but we do not know yet that U "spans" 
P in the sense that the set $([/) has full rank (i.e., it spans R""). However, this 
is easily rectified by a rank reduction argument. Suppose that ^(U) did not have 
full rank, we are going to produce a new proper generalized arithmetic progression 
P' which still contains U and satisfies (|8.2p but has rank strictly smaller than the 
rank of P. Since the rank of P is 0(1), the process must terminate after at most 
0(1) iterations. 

We are going to produce P' as follows. If $([/) does not have full rank, then it is con- 
tained in a hyperplane of R*^. In other words, there exist integers ai, . . . , whose 
greatest common divisor is one and aimi + . . . + artrir = for all (mi, . . . , m^) € 
^{U). Let w be an arbitrary element of F and replace the each basis vector Vj in 
P by Vj — ajw. The new progression P' will continue to contain U, since we have 

mi(ui — aiw) + . . . + rririvr — a^Wr) = mi^i + . . . + m^w^ 

for all (mi, . . . ,mr) G ^(U). We can assume, without loss of generality, that ar is 
not divisible by p, the characteristic of F (at least one of the aj must be so). Select 
w so that 



Vr — arW — 0. 

This shows that P' has rank r — 1. Moreover, its volume is Mi, . . . M^-i which is 
at most the volume of P. We can use Lemma [6.31 to guarantee that P' is proper 
without increasing the rank. Furthermore, it is easy to see that (|5.5p is still valid 
with respect to P'. The argument is completed. Thus, from now on we can assume 
that ^{U) spans R'^. This information will be useful later on. 
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Define a highly rational number to be a number (in Q or F) which is of the form 
a/b where a,b are integers such that a,b = n°("^ and b ^ 0. Define a highly 
rational linear comhination of vectors wi , . . . , Wfe to be any expression of the form 
qiWi + . . . + qkWk where gi, . . . , (7^ are highly rational. 

We say that a set W (of vectors) economically spans a set U (of vectors) if there are 
numbers a,b = n°^"' such that each u € U can be represented as a highly rational 
linear combination of vectors in W where the numerators and denominators of the 
coefficient are uniformly bounded from above by a and b, respectively. 

The moral of our arguments below is the following: with respect to a bounded 
number of operations, any linear algebraic statement which holds for "span" also 
holds for "economically span" . This relies on the fact that any algebraic expression 
involving at most 0{1) highly rational numbers will again produce a highly rational 
number. In other words, the highly rational numbers behave heuristically like a 
subfield of F. 

Let us now turn to the details. First we make some simple remarks about the notion 
of economically spanning. It is a transitive in the sense that if W economically 
spans U and U economically span T, then W economically spans T (but with 
slightly worse o(n) constants). Furthermore, it is closed under union, in the sense 
that if W economically spans X and Y, then it economically spans the union of X 
and Y. Of course we are going to use these rules only a bounded number 0(1) of 
times. 

Consider the set of vectors U := {ai,...,a„}. We know that there is a set of 
vectors of cardinality r which economically span U, namely {vi, . . . ,Vr}. Now we 
are going to make use of the fact that $(?7) spans R'". Since each vector in <&([/) 
has coordinates at most maxi<i<„ Mi < 2" = and r = 0(1), it follows from 

Cramer's rule that ^{U) economically spans the basic vectors {ei, . . . , e^.}. As the 
map $ is bijective (P is proper), this means U economically spans {vi, . . . ,Vr}. 

We now claim that in fact there is a single element Vi which economically spans U. 
This clearly implies rational commensurability. 

Let s be the smallest number such that there is a subset of size s of wi, ...Vr eco- 
nomically spanning U. Without loss of generality, we assume that vi,...,Vs spans 
U and write 



where c,j are highly rational. Let us now consider two cases: 

• The matrix C = (cjj) has rank one. In this case the numbers aj/aj arc 
highly rational (as all the Cy are). But $(C/) has full rank in and 
economically spans Vi,..,Vr, so Vi/vj are also highly rational. This means 



s 



(8.3) 
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that vi (or any of the Vi, for that matter) economically spans wi, . . . ,fr, 
and hence also U by transitivity. 
• The matrix C has rank larger than one. Recall that (ai, .., a„) is the normal 
vector of a hyperplane spanned by ±1 vectors. Thus, there is a ±1 vector w 
which is orthogonal to a = (oi, ...a„), but not orthogonal to C. Express the 
inner product of a and w as a linear combination oi vi, . . . ,Vs using (|8.3p . 
Observe that all of the coefficients are highly rational. Moreover, the value 
of the combination is 0, but there is at least one non-zero coefficient. This 
implies that wc can express one of wi , . . . , as a linear combination of the 
others with highly rational coefficients and this contradicts the minimality 
of s. 

The proof of the structure theorem is complete. ■ 



9. Appendix: Proof of Lemma [F751 

In this section we give a proof of Lemma 16.31 This result appears to be "folklore" , 
being discovered independently by Gowers and Walters (private communication) 
and Ruzsa (private communication), but does not appear to be explicitly in the 
literature. 

We first need some notation and tools from the geometry of numbers. The first 
tool is the following theorem of Mahler on bases of lattices, which is a variant of 
Minkowski's second theorem (and is in fact proven using this theorem). 

Lemma 9.1. Let T be a lattice of full rank in R'* (i.e. a discrete additive sub- 
group of R with d linearly independent generators). Then there exists linearly 
independent vectors wi, . . . , S F which generate T, and such that 

(9.1) \wi\... \wd\ < (Cd)^Ves(R''/r) 

where mes(R'^/r) is the volume of a fundamental domain of T and C > is an 
absolute constant. 

Proof. (Sketch) Note that the standard Minkowski basis vi,. . . ,Vd of F (with re- 
spect to the unit ball Bd) will obey (|9.ip but need not generate F (consider for 
instance the lattice generated by Z'' and (i, . . . , i) for dimensions d > 5). How- 
ever, a simple algorithm of Mahler gives a genuine basis wi, . . . ,Wd of F such that 
Wj = tj^ivi + . . . + tjjVj for some real scalars \tj^i\ < 1, which thus also obeys (|9.ip 
with some degradation in the {Cd)'-''^ factor. See, for instance [21 Chapter 8] for 
more details. □ 

If d > 1 is an integer, and AI = (Mi, . . . , Md) and N = {Ni, . . . , Nd) are elements 
of R*^ such that Mi < Ni for all 1 < « < d, we use [M, N] to denote the discrete 
box 

[M, N] := {(ni, . . . , Ud) € Z'' : M, < n, < N, for aU 1 < i < d}. 
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Also, if G is an additive group, n — (ni, . . . , Ud) G 7/^ , and v — (vi, . . . , Vd) G G"^, 
we use n-vCz G to denote the quantity n-v — nivi + . . . + naVd, where of course we 
use nx := a; + . . . + X to denote the n-fold sum of x (with the obvious modification 
if n is negative). We also use [M, TV] • u C G to denote the set [Af, N] ■ v := {n ■ v : 
n e [M, N\]. Note that a progression of rank d in G is nothing more than a set of 
the form a + [0, N] ■ v for some a £ G, some d-tuple N — {Ni, . . . , Nd) G Z'J_, some 
other c?-tuple v = {vi, . . . , Vd) G G'^. 

We now give a "discrete John's theorem" which shows that the intersection of a 
convex symmetric body (i.e. a bounded open convex symmetric subset of R"*) and 
a lattice of full rank is essentially equivalent to a progression. 

Lemma 9.2 (Discrete John's theorem). Let B be a convex symmetric body in TC^, 
and let T be a lattice in R'' of full rank. Then there exists a d-tuple 

w = {wi, . . . ,Wd) eV^ 

of linearly independent vectors in T and and a d-tuple N = (iVi, . . . , Nd) of positive 
integers such that 

(-7V, iv) • w c B n r c {-d^'^N, d'^'^N) ■ w 

where C is an absolute constant. 



Proof. We first observe using John's theorem \W\ and an invcrtiblc linear transfor- 
mation that we may assume without loss of generality that Bd Q B ^ d - Bd, where 
Bd is the unit ball in R''. We may assume d >2 since the claim is easy otherwise. 

Now let VU = {wi, . . . , vud) be as in Lemma 19.11 For each j, let Nj be the least 
integer greater than l/dlwjj. Then from the triangle inequality we see that jniwi + 
. . . + UdWdl < 1 whenever \nj\ < Nj, and hence {—N, N) ■ w is contained in Bd and 
hence in B. 

Now let x G -B n F. Since w generates F, we have x = niWi + . . . + UdWd for some 
integers ni, . . . , n^; since B C d ■ Bd, we have |a:;| < d. Applying Cramer's rule to 
solve for ni , . . . , nd and (|9.ip , we have 

. . \x Awi . . .Wj-i Awj+i Awd\ 
^ \vuiA...Awd\ 

< \x\\wi\...\wd\ 

~ \wj\\wi A ...Awd\ 

< dd^'^/lwjl 

< d^'-^Nj, 

and hence x G {-d^'^N, d'^'^N) ■ w as desired. □ 

We can now prove Lemma [6.31 which we restate here with slightly different notation. 

Lemma 9.3. There is a constant Go such that the following holds. Let P be a 
symmetric progression of rank d in a abelian group G, such that every non-zero 
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element of G has order at least dP"'^ \P\. Then there exists a symmetric proper 
progression Q of rank at most d containing P and 

|g|<d^°'^V|. 

Proof. This claim is analogous to the basic linear algebra statement that every 
linear space spanned by d vectors is equal to a linear space with a basis of at most 
d vectors. Recall that the proof of that linear algebra fact proceeds by a descent 
argument, showing that if the d spanning vectors were linearly dependent, then one 
could exploit that dependence to "drop rank" and span the same linear space with 
d—l vectors. 

We may assume that P has the form P = [-iV/2, N/2] -v for some N = {Ni,..., Nd) 
and V = {vi,...,Vd)- We induct on d. The case d = 1 is easy. Now suppose 
inductively that d > 2, and the claim has already been proven for d— 1 (for arbitrary 
groups G and arbitrary progressions P). Thanks to the induction hypothesis, we 
can assume that P is non-proper and all Ni are at least one. As P is not proper, 
there is n =^ n' e [-N/2, N/2] such that 

n ■ V = n' ■ V. 

Let To C Z"^ denote the lattice {m e Z'^ : m-v = 0}, then it follows that ron[-iV, N] 
contains at least one non-zero element, namely n' — n. 

Let m — (mi, . . . , TOd) be a non-zero irreducible clement of Tq D [—N,N] (i.e. 
m/n ^Tq for any integer n > 1). By definition 

(9.2) m ■ V = mi ■ vi + . . . + rud ■ Vd = 0. 

We now claim that m is irreducible in Z'^ (i.e. that mi, . . . ,md have no common 
divisor). Indeed, if m factored as m = nfh for some n > 1 and m S Z'', then m • v 
would be a non-zero element of G of order n < \P\, contradicting the hypothesis. 

Our plan is to contain P inside a symmetric progression Q of rank d—l and 
cardinality 

(9.3) |g|<d^'V|. 

If we can achieve this, then by the induction hypothesis we can contain Q inside a 
proper symmetric progression R of rank at most d—l and cardinality 

\R\ < {d-lf°'^'^-'^^\Cdf'^'\P\. 

If Co is sufficiently large (but still independent of d) , then the right-hand side is 
at most |p|^ g^j^^j -^^g have closed the induction hypothesis. 

It remains to cover P by a symmetric progression of rank at most d—l with the 
bound (|9.3p . Observe that m lies in [—N, N], so the rational numbers 



mi/iVi, . . . ,md/Nd 
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lie between —1 and 1. Without loss of generality we may assume that rud/Nd has 
the largest magnitude, namely 

(9.4) \md\/Nd>\m,\/N, 

for all 1 < i < d. By replacing Vd with —Vd if necessary, we may also assume that 
fUd is positive. 

To exploit the cancellation in (|9.2p we introduce the rational vector q e ^ • 7/^^^ 
by the formula 

mi TOd-i, 

q := . 

rrid nid 

Since gcd(r7ii, . . . , md) — 1, we see that for any integer n, that n ■ q lies in Z''^^ if 
and only if n is a multiple of md- 

Next, let r C R''^^ denote the lattice T := Z''^^ + Z ■ q. Since q is rational, this is 
indeed a full rank lattice. We define the homomorphism / : F ^ Z by the formula 

/((ni, . . . ,nd^i) + Udq) := (rti, . . . ,7id) • w. 

the condition (19. 2p ensures that this homomorphism is indeed well defined, in the 
sense that different representations w = (ni, . . . ,rtd_i) + Udq of the same vector 
ui S r give the same value of f{w). We also let B C R denote the convex 
symmetric body 

B {(ii, . . . , td-i) e R"^"^ : -3iVj < tj < 3Nj for aU 1 < j < d - 1}. 

We now claim the inclusions 

P C /(B n r) C 5P - 5P. 

To see the first inclusion, let n • w G P for some n S [0, A^], then we have n ■ v = 
/((m, . . . , nd-i)+ndq)- From ()9.4I) we see that the j*'' co-efficient of (ni, . . . , nd-i)+ 
Udq has magnitude at most 3Nj, and thus n-v lies in /(PnF) as claimed. To see the 
second inclusion, let (rii, . . . , n^-i) + ndq be an element of P H F. By subtracting 
an integer multiple of md from rid if necessary (and thus adding integer multiples 
of mi, . . . , md-i to ni, . . . , n^-i) we may assume that \nd\ < \md\/2. By (j9.4p and 
the definition of P, this forces \nj\ < 5Nj for all I < j < d, and hence 

/((m, . . . , ud-i) + Udq) ^ (m, . . . , rid) • w C [-5iV, 5iV] • w = 5P - 5P. 

Next, we apply Theorem 19.21 to find vectors Wi, . . . , w^-i G T and Afi, . . . , Md_i 
such that 

(-M, Af) • w C P n F C {-(f'^M, (f'^M) ■ w. 

Applying the homomorphism /, we obtain 

(-A/, M) • /(w) C /(P n F) C {-(f'^M, (f'^M) ■ f{w) 

where /(w) := (/(wi), . . . , /(wd-i)- Observe that {-(f'^M,dP'^M) ■ f{w) is a 
symmetric progression of rank d — 1 which contains /(P ft F) and hence contains 
P. Also, since {-d^'^M, d^'^M) ■ f{w) can be covered by 0{{Cd)^'^^) translates of 
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(— M, AI) ■ /(w), and similarly 5P — 5P can be covered by 0{C'^) translates of P, 
we have 

\i-d^'^M,d^'^M)- f{w)\ < {Cdf'^'\f{BnT)\ 

< iCdf'^'\5P-5P\ 

< [Cdf-^'^C^lPl 

which proves (|9.3p . This completes the induction and proves the theorem. □ 

Remark 9.4. It is not hard to remove the hypothesis that G is "nearly torsion free" 
in the sense that the order of every non-zero element is much larger than P, however 
one must then drop the conclusion that Q is symmetric (this can be seen simply 
by considering the case G — Z/2Z). 



References 

[1] Y. Bilu, Structure of sets with small sumset. Structure theory of set addition. Astrisque No. 
258 (1999), xi, 77-108. 

[2] Y. Bilu, V. Lev, I. Ruzsa, Rectification principles in additive number theory. Discrete Comput. 

Geom. 19 (1998), 343-353. 
[3] J. W.S. Cassels, An introduction to the geometry of numbers. Springer, Berlin, 1959. 
[4] M. Chang, A polynomial bound in Freiman's theorem, Duke Math. J. 113 (2002), no. 3, 

399-419. 

[5] P. Erdos, On a lemma of Littlewood and Offord, Bull. Amer. Math. Soc. 51 (1945), 898-902. 
[6] G. Freiman, Foundations of a structural theory of set addition. Translated from the Rus- 
sian. Translations of Mathematical Monographs, Vol 37. American Mathematical Society, 
Providence, R. I., 1973. vii+108 pp. 
[7] B. Green, I. Ruzsa, Sets with small sumset and rectification, preprint. 
[8] B. Green, I. Ruzsa, Freiman's theorem in an arbitrary abelian group, preprint. 
[9] G. Halasz, Estimates for the concentration function of combinatorial number theory and 
probability. Period. Math. Hungar. 8 (1977), no. 3-4, 197-211. 
[10] F. John, Extremum problems with inequalities as subsidiary conditions. Studies and Essays 
presented to R. Courant on his 60th birthday, Jan 8. 1948, Interscience Publishers Inc., New 
York, NY 1948, 187-204. 
[11] J. Kahn, J. Komlos, E. Szemeredi, On the probability that a random ±1 matrix is singular, 

J. Amer. Math. Soc. 8 (1995), 223-240. 
[12] J. Komlos, On the determinant of (0, 1) matrices, Studia Sci. Math. Hungar. 2 (1967) 7-22. 
[13] J. Komlos, On the determinant of random matrices, Studia Sci. Math. Hungar. 3 (1968) 
387-399. 

[14] A. Odlyzko, On subspaces spanned by random selections of ±1 vectors, J. Combin. Theory 

Ser. A 47 (1988), no. 1, 124-133. 
[15] I. Ruzsa, Generalized arithmetical progressions and sumsets. Acta Math. Hungar. 65 (1994), 

no. 4, 379-388. 

[16] I. Ruzsa, An analog of Freiman's theorem in groups. Structure theory of set addition, 

Asterisque No. 258 (1999), 323-326. 
[17] T. Tao and V. Vu, On random ±1 matrices: Singularity and Determinant, Random Structures 

and Algorithms 28 (2006), no 1, 1-23. 



TERENCE TAO AND VAN VU 



Abstract. Let n be a large integer and M„ be a random n by n matrix whose 
entries are i.i.d. Bernoulli random variables (each entry is ±1 with probability 
1/2). We show that the probability that M„ is singular is at most (3/4+o(l))" , 
improving an earlier estimate of Kahn, Komlos and Szemeredi 1111 . as well as 
earlier work by the authors ri7| . The key new ingredient is the applications of 
Freiman type inverse theorems and other tools from additive combinatorics. 



